IRIT at INEX 2013: Tweet Contextualization Track

نویسندگان

Liana Ermakova

Josiane Mothe

چکیده

The paper presents IRIT’s approach used at INEX Tweet Contextualization Track 2013. Systems had to provide a context to a tweet. This year we further modified our approach presented at INEX 2011 and 2012 underlain by the product of scores based on hashtag processing, TF-IDF cosine similarity measure enriched by smoothing from local context and document beginning, named entity recognition and part-of-speech weighting. We assumed that relevant sentences come from relevant documents therefore we multiply sentence score by document relevance. We also used generalized POS (e.g. we merge regular adverbs, superlative and comparative into a single adverb group). We introduced sentence quality measure based on Flesch reading ease test, lexical diversity, meaningful word ratio and punctuation ratio. Our approach was ranked first, second and third over 24 runs submitted by all participants on different reference pools according to informativeness evaluation. At the same time it obtained the best readability score.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Automatic Greedy Summarization System at INEX 2013 Tweet Contextualization Track

According to the organizers, the aim of the 2013 INEX Tweet Contextualization Track is: “...given a tweet, the system must provide some context about the subject of the tweet, in order to help the reader to understand it. This context should take the form of a readable (and short) summary, composed of passages from [...] Wikipedia.” We present an automatic greedy summarizer named REG applied to...

متن کامل

Ultra-stemming and Statistical Summarization at INEX 2013 Tweet Contextualization Track

According to the organizers, the objective of the 2013 INEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present the Cortex summarizer applied to the INEX 2...

متن کامل

Three Statistical Summarizers at CLEF-INEX 2013 Tweet Contextualization Track

According to the organizers, the objective of the 2014 CLEFINEX Tweet Contextualization Task is: “...The Tweet Contextualization aims at providing automatically information a summary that explains the tweet. This requires combining multiple types of processing from information retrieval to multi-document summarization including entity linking.” We present three statistical summarizer systems ap...

متن کامل

IRIT at INEX: Question Answering Task

In this paper we describe an approach to tweet contextualization developed in the context of INEX QA track. The task is to provide a context up to 500 words to a tweet. The summary should be an extract from the Wikipedia. Our approach is based on the index which includes not only lemmas, but also named entities. Sentence retrieval is based on standard TF-IDF measure enriched by named entity rec...

متن کامل

IRIT at INEX 2012: Tweet Contextualization

In this paper, we describe an approach for tweet contextualization developed in the context of the INEX 2012. The task was to provide a context up to 500 words to a tweet from the Wikipedia. As a baseline system, we used TF-IDF cosine similarity measure enriched by smoothing from local context, named entity recognition and part-of-speech weighting presented at INEX 2011. We modified this method...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

IRIT at INEX 2013: Tweet Contextualization Track

نویسندگان

چکیده

منابع مشابه

An Automatic Greedy Summarization System at INEX 2013 Tweet Contextualization Track

Ultra-stemming and Statistical Summarization at INEX 2013 Tweet Contextualization Track

Three Statistical Summarizers at CLEF-INEX 2013 Tweet Contextualization Track

IRIT at INEX: Question Answering Task

IRIT at INEX 2012: Tweet Contextualization

عنوان ژورنال:

اشتراک گذاری